Augmenting with Slot Filler Relevancy Signatures Data
نویسندگان
چکیده
Human readers can reliably identify many relevant texts merely by skimming the texts for domain-specific cues. These quick relevancy judgements require two steps: (1) recognizing an expression that is highly relevant to the given domain, e.g. "were killed" in the domain of terrorism, and (2) verifying that the context surrounding the expression is consistent with the relevancy guidelines for the domain, e.g. "5 soldiers were killed by guerrillas" is not consistent with the terrorism domain since victims of terrorist acts must be civilians 1. The Relevancy Signatures Algorithm attempts to simulate the first step in this process by deriving reliable relevancy cues from a corpus of training texts and using these cues to quickly identify new texts that are highly likely to be relevant. But since this algorithm makes no attempt to look beyond the relevancy cues, it will occasionally misclassify texts when the surrounding context contains additional information that makes the text irrelevant.
منابع مشابه
Overview of UI-CCG Systems for Event Argument Extraction, Entity Discovery and Linking, and Slot Filler Validation
In this paper, we describe the University of Illinois (UI CCG) submission to the 2013 TAC KBP Event Argument Extraction (EAE), English Entity Discovery and Linking (EDL), and Slot Filler Validation (SFV) tasks. We developed three separate systems. Our Event Argument Recognition system infers world knowledge from event argument overlaps to improve performance of a recognition/labeling pipeline. ...
متن کاملUnsupervised Person Slot Filling based on Graph Mining
Slot filling aims to extract the values (slot fillers) of specific attributes (slots types) for a given entity (query) from a largescale corpus. Slot filling remains very challenging over the past seven years. We propose a simple yet effective unsupervised approach to extract slot fillers based on the following two observations: (1) a trigger is usually a salient node relative to the query and ...
متن کاملBIT at TAC 2013 Slot Filler Validation Track
This paper presents the design and implementation of our English slot filling system. The objective of the slot filling task is to extract attribute values of the given entities. We developed a slot filling system which employs a combinative technique of dependency patterns matching and SVM-based supervision approach. Evaluation results show the strength and weakness of our technique.
متن کاملClassifying Texts using Relevancy Signatures
Text processing for complex domains such as terrorism is complicated by the difficulty of being able to reliably distinguish relevant and irrelevant texts. We have discovered a simple and effective filter, the Relevancy Signatures Algorithm, and demonstrated its performance in the domain of terrorist event descriptions. The Relevancy Signatures Algorithm is based on the natural language process...
متن کاملBootstrapping Knowledge Base Acceleration
The Streaming Slot Filler (SSF) task in TREC Knowledge Base Acceleration track involves detecting changes to slot values (relations) over time. To handle this task, the system needs to extract relations to identify slot-filler values and detect novel values. Being the first attempt at KBA, the biggest challenge that we faced was the scale of the data. We present the approach used by University ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1992